BIOTECHNOLOGY afl ar-S 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 



Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: Datin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 

■ v / 
Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, th&s saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://^w^.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES: 

1 Wrapped Nucleics 

Wrapped Aminos 

2 Invalid Line Length 

3 Misaligned Amino 

Numbering 

4 Non-ASCII 



11 



12 



Variable Length. 



Patenlln 2.0 
"bug" 



Skipped Sequences 



(OLD RULES) 



(NEW RULES) 



9 Use ofn's or Xaa's 

sTEW RULES) 




Jnvalid <213> 
Response 

Use of<220> 



Patentln 2.0 
"bug" 



SUGG ESTED CORRECTION SERIAL NUMBER: 

PLEASE DISREGARD ENGLISH 44 ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent 'Swapping." 

The rules require that a line not eiceed 72 characters in length, this includes white spaces. 

The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead 

The submitted file was not saved in ASCH(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission Is saved In ASCII text. 

Sequences) contain n*s or Xaa's representing more than, one residue. Per Sequence Rules, 

each norXaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2,0 has caused die <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would* automatically, generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequence^) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where T is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 



8 Skipped Sequences ' . Sequence(s)_ 



missing. If Intentional, please insert the following lines for each skipped sequence. 



<2 1 0> sequence id number 
<400> sequence id number 
000 

Use of n*s and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific'name (Genus/species). <220>-<223> section is required when <2 1 3> response is Unknown or 
is Artificial Sequence 



Sequences) _ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unlcnown " Please explain source of genetic material in <220> to <223> section. 
(See "Federal Regrster," 06/0 1/1998, Vol. 63. No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 . Misuse of n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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OIPE 



RAW SEQUENCE LISTING DATE: 09/20/2001 

PATENT APPLICATION: US/09/943,531 TIME: 13:40:18 ^ 

Input Set : A:\GG119-2US.ST25.txt >/* 
Output Set: N:\CRF3\09202001\I943531.raw I 

Does Not Comply 

3 <no> applicant: Risinger, cari Corrected Diskette Needed 

4 Andersson, Maria K. 

5 Lewander, Tommy K. 

6 Olaisson, Erik K. 

8 <120> TITLE OF INVENTION: Detection of CYP2C19 Polymorphisms 

10 <130> FILE REFERENCE: GG119.2US 
> 12 <140> CURRENT APPLICATION NUMBER: US/09/94 3 f 531 

12 <141> CURRENT FILING DATE : 2001-08-30 

12 <150> PRIOR APPLICATION NUMBER: GB 0021286.0 

13 <151> PRIOR FILING DATE: 2000-08-30 
15 <160> NUMBER OF SEQ ID NOS : 37 
17 <170> SOFTWARE: Patentln version 3.1 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 1239 

21 <212> TYPE: DNA 

22 <213> ORGANISM: homo sapiens 

24 <400> SEQUENCE: 1 

25 aaaatcaata taaagcagcc atgtctggag gagaccagga ggtcaagaag ccttagtttc 60 
27 tcaagccctt agcaccaaat tctctgagat cagctcttcc ttcagttaca ctgagcgttt 120 
29 cccctctgca gtgatggaga agggagaact cttatttttt ctcatgagca tctctggggc 180 
31 tgttttcctt agataaataa gtggttctat ttaatgtgaa gcctgtttta tgaacaggat 24 0 
33 gaatgtggta tatattcaga ataactaakg tttggaagtt gttttgtttt gctaaaacaa 300 
35 agttttagca aacgattttt tttttcaaat ttgtgtcttc tgttctcaaa gyatctctga 360 
37 tgtaagagat aatgcgccac gatgggcatc agaagacctc agctcaaatc ccagttctgc 420 
39 cagctatgag ctgtgtggca ccaacaggtg tcctgttctc ccagggtctc ccttttccca 480 
41 tttgaaatat aaaaaataac aattcctgcc ttcacgtgtt tttttagggg gttaaatggt 540 
43 aaaggtgttt atatctgcta aggtaattta cttgatatat gtttggttat tgaagatata 600 
45 tgagttatgt tagctatttc atgtttaggc tgctgtattt ttagtaggct atattaaata 660 
47 gaggatttca ttataaagga caaagtctcc taatcttcga tataggattg acatactttt 720 
49 taaatataca aggcatagaa tatggccatt tccgttaaat cataaattcc caactggtta 780 
51 ttaatctaag aattcagaat tttaagtaat tgtttttgca tcagattgtt tacttcagtg 84 0 
53 ctctcaatta tgacggtgca ttggaaccac ttgggttaac atttttttgt ttttattacc 900 
55 aatacctagg cttcaaccta gtacaatgaa accagaatgt acagagtggg cactgggacg 960 
57 aaggagaaca agaccaaagg acattttatt tttatctcta tcagtgggtc aaagtccttt 1020 
59 cagaaggagc atatagtggg cctaggtgat tggccactty atccatcaaa gaggcacaca 1080 
61 cacttaatta gcatggagtg ttataaaaag cttggagtgc aagctcacgg ttgtcttaac 1140 
63 aagaggagaa ggcttcaatg gatccttttg tggtccttgt gctctgtctc tcatgtttgc 1200 
65 ttctcctttc aatctggaga cagagctctg ggagaggaa 123 9 

68 <210> SEQ ID NO: 2 

69 <211> LENGTH: 11 / I 0 . 

70 <212> TYPE: DNA^— ^ J i ^ J^^Mm A^T 

71 <213> ORGANISM: ^synthetic jj&Jol /M*^ < ^ v/x v ^ < 

73 <400> SEQUENCE: 2 ^ ^ 

74 actaatgttt g 11 

11 <210> SEQ ID NO: 3 

78 <211> LENGTH: 11 

79 <212> TYPE: DNA 
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RAW SEQUENCE LISTING DATE: 09/20/2001 

PATENT APPLICATION: US/09/943,531 TIME: 13:40:18 

Input Set : A:\GG119-2US.ST25.txt 
Output Set: N:\CRF3\09202001\I943531.raw 

80 <213> ORGANISM :(s} 

82 <400> SEQUENCE: 

83 caaagcatct c 11 

86 <210> SEQ ID NO: 4 

87 <211> LENGTH: 11 

88 <212> TYPE: DNA ^^—^^^ 

89 <213> ORGANISM ^^n.tf^tic) 

91 <400> SEQUENCE: 4^- —*~""^ 

92 cactttatcc a 11 

95 <210> SEQ ID NO: 5 

96 <211> LENGTH: 11 

97 <212> TYPE: DNA 

98 <213> ORGANISM ^synthetic 

100 <400> SEQUENCE: 

101 actaaggttt g 11 

104 <210> SEQ ID NO: 6 

105 <211> LENGTH: 11 

106 <212> TYPE: DNA 

107 <213> ORGANISM<r^sySthetic 

109 <400> SEQUENCE:6 ' 

110 caaagtatct c 11 

113 <210> SEQ ID NO: 7 

114 <211> LENGTH: 11 

115 <212> TYPE: DNA^—^--- 

116 <213> ORGANIS^f synthetic/ 

118 <400> SEQUENCEV^Z. . — ^ 

119 cacttcatcc a 11 

122 <210> SEQ ID NO: 8 

123 <211> LENGTH: 22 

124 <212> TYPE: DNA^- — > ^ x 

125 <213> ORGANISl<r"synthetic^ 
127 <400> SEQUENCE:^- 

12 8 caggaggtca agaagcctta gt 22 

131 <210> SEQ ID NO: 9 

132 <211> LENGTH: 19 

133 <212> TYPE: DNA^ 

134 <213> ORGANS 

136 <400> SEQUENCE: 

137 ccatcgtggc gcattatct 19 

140 <210> SEQ ID NO: 10 

141 <211> LENGTH: 20 

142 <212> TYPE: DNA. 



DNA^- — > ^ -v. 

3^: synthetic ) 

:E>-a___^-^ 



Z <^Z _LZ-^ J. 1 : JJINA^— . ^ >. 

143 <213> ORGANIST: synthetic/ 
14 5 <400> SEQUENC&4__1£L_-- — ^ 



<400> SEQUENC 

146 acggtgcatt ggaaccactt 20 

149 <210> SEQ ID NO: 11 

150 <211> LENGTH: 21 

151 <212> TYPE: DNA 

152 <213> ORGANISM: synthetic 
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RAW SEQUENCE LISTING DATE: 09/20/2001 

PATENT APPLICATION: US/09/94 3,531 TIME: 13:40:18 

Input Set : A:\GG119-2US.ST25.txt 
Output Set: N:\CRF3\09202001\I943531.raw 

154 <400> SEQUENCE: 11 

155 cccagagctc tgtctccaga t 21 

158 <210> SEQ ID NO: 12 

159 <211> LENGTH: 17 

160 <212> TYPE: DNA. 



161 <213> ORGAN I SM^Ls^ ri€he t I c^ 

163 <400> SEQUENCE: 12 

164 agtgggcact gggacga 17 

167 <210> SEQ ID NO: 13 

168 <211> LENGTH: 20 

169 <212> TYPE: DNA V 



j_oy <a±z> ±ir&: una^~ 
170 <213> ORGAN I SM (^syptlie\ic^ 



172 <400> SEQUENCE: TT 

173 gatccattga agccttctcc 20 

176 <210> SEQ ID NO: 14 

177 <211> LENGTH: 23 

178 <212> TYPE: DNA 



179 <213> ORGANISM ^^nthe^ic^^ 
181 <400> SEQUENCE: T^——- — 



SEQUENCE : 

182 gtaattgttt ttgcatcaga ttg 23 

185 <210> SEQ ID NO: 15 

186 <211> LENGTH: 23 

187 <212> TYPE: DNA^ — 

188 <213> ORGANISJ^synth^tic^) 

190 <400> SEQUENCE:T5 

191 tccatgctaa ttaagtgtgt gtg 23 

194 <210> SEQ ID NO: 16 

195 <211> LENGTH: 22 

196 <212> TYPE: DNA 

197 <213> ORGANISM^ synthetic 

199 <400> SEQUENCE: 

200 ctgagatcag ctcttccttc ag 22 

203 <210> SEQ ID NO: 17 

204 <211> LENGTH: 24 

205 <212> TYPE: DNA 

206 <213> ORGANISM^ syrithe~tic - 

208 <400> SEQUENCE: 

209 aggcaggaat tgttattttt tata 24 

212 <210> SEQ ID NO: 18 

213 <211> LENGTH: 20 

214 <212> TYPE: DNA 

215 <213> ORGANIS^T^s^th'etic) 

217 <400> SEQUENCE>-4£-_-~-^^ 

218 tggggctgtt ttccttagat 20 

221 <210> SEQ ID NO: 19^ 

222 <211> LENGTH: 22 

223 <212> TYPE: DNA 

224 <213> ORGANISM^ synthetic 
226 <400> SEQUENCE^ 
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RAW SEQUENCE LISTING DATE: 09/20/2001 

PATENT APPLICATION: US/09/943,531 TIME: 13:40:18 

Input Set : A:\GG119-2US.ST25.txt 
Output Set: N:\CRF3\09202001\I943531.raw 

227 atttaacccc ctaaaaaaac ac 22 

230 <210> SEQ ID NO: 20 

231 <211> LENGTH: 11 

232 <212> TYPE: DNA 

233 <213> ORGANISM :<^yn^ieticj^ 

235 <400> SEQUENCE: 20 

236 caaacattag t 11 

239 <210> SEQ ID NO: 21 

240 <211> LENGTH: 11 

241 <212> TYPE: DNA 



242 <213> ORGANISM^^yrft^ticT^ 
244 <400> SEQUENCE :2± 



245 caaaccttag t 11 

248 <210> SEQ ID NO: 22 

249 <211> LENGTH: 11 

250 <212> TYPE: DNA 

251 <213> ORGANISM : < syn€Ketic ^> 

253 <400> SEQUENCE: 22 

254 gagatgcttt g 11 

257 <210> SEQ ID NO: 23 

258 <211> LENGTH: 11 

259 <212> TYPE: DNA 



260 <213> ORGANISM: (^rftheticj^) 
262 <400> SEQUENCE : 23 — ^ 



263 gagatacttt g 11 

266 <210> SEQ ID NO: 24 

267 <211> LENGTH: 11 

268 <212> TYPE: DNA^^^_^ 

269 <213> ORGANIS^C ^nthet i^ 

271 <400> SEQUENCE: 24 

272 tggataaagt g 11 

275 <210> SEQ ID NO: 25 

276 <211> LENGTH: 11 

277 <212> TYPE: DNA 



278 <213> ORGANISMS synthetic 



280 <400> SEQUENCES-^! 



281 tggatgaagt g 11 

284 <210> SEQ ID NO: 26 

285 <211> LENGTH: 11 

286 <212> TYPE: DNA- — ^ ^ 

287 <213> ORGANISM N^xathetic^> 

289 <400> SEQUENCE: 26 

290 tcagaataac t 11 

293 <210> SEQ ID NO: 27 

294 <211> LENGTH: 11 

295 <212> TYPE: DNA ^ 

296 <213> ORGANISM ^synthe tjjer 

298 <400> SEQUENCE:27 

299 tctgttctca a 11 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/943 f 531 



DATE: 09/20/2001 
TIME: 13:40:18 



Input Set : A:\GG119-2US.ST25.txt 
Output Set: N:\CRF3\09202001\I943531.raw 




30 




302 <210> SEQ ID NO: 28 

303 <211> LENGTH: 11 

304 <212> TYPE: DNA 

305 <213> ORGANISM :Csynthetic^ 

307 <400> SEQUENCE: 28 ^ 

308 tgattggcca c 

311 <210> SEQ ID NO: 29 

312 <211> LENGTH: 11 

313 <212> TYPE: DN2 

314 <213> ORGANISM( 

316 <400> SEQUENCE: 

317 agttattctg a 

320 <210> SEQ ID NO: 

321 <211> LENGTH: 11 

322 <212> TYPE: DNA, 

323 <213> ORGANISM ^S^ithet 

325 <400> SEQUENCE: 30 

326 ttgagaacag a 

329 <210> SEQ ID NO: 31 

330 <211> LENGTH: 11 

331 <212> TYPE: DN2 

332 <213> ORGANISM^, 
3 34 <400> SEQUENCE: 
335 gtggccaatc a 

338 <210> SEQ ID NO: 

339 <211> LENGTH: 10 

340 <212> TYPE: DNA 

341 <213> ORGANISM: (^synthetic 

343 <400> SEQUENCE: 3^ 

344 acttccaaac 

347 <210> SEQ ID NO: 33 

348 <211> LENGTH: 11 

349 <212> TYPE: DN/ 

350 <213> ORGANISM: 

352 <400> SEQUENCE: 

353 acatcagaga t 

356 <210> SEQ ID NO: 

357 <211> LENGTH: 11 

358 <212> TYPE: DNA^ 

359 <213> ORGANISM^ synthetic 

361 <400> SEQUENCE: 

362 ctttgatgga t 

365 <210> SEQ ID NO: 35 

366 <211> LENGTH: 10 

367 <212> TYPE: DNA, 

368 <213> ORGANISM ^synthet 

370 <400> SEQUENCE: 3 5 

371 gtttggaagt 
374 <210> SEQ ID NO: 36 



11 



11 



11 



11 



32 



10 




11 



34 



11 



10 



The types oF errors shown exist throughout 
the Sequence Listing. Please check subsequent 
sequences for siaiilar exrors. 
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VERIFICATION SUMMARY DATE: 09/20/2001 

PATENT APPLICATION: US/09/943, 531 TIME: 13:40:19 

Input Set : A:\GG119-2US.ST25.txt 
Output Set: N:\CRF3\09202001\I943531.raw 

L:12 M:270 C: Current Application Number differs, Replaced Current Application No 
L:12 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
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